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Abstract 

In the search of appropriate riemannian metrics on quantum state space the concept of statistical 
monotonicity, or contraction under coarse graining, has been proposed by Chentsov. The metrics with 
this property have been classified by Petz. All the elements of this family of geometries can be seen as 
quantum analogues of Fisher information. Although there exists a number of general theorems sheding 
light on this subject, many natural questions, also stemming from applications, are still open. In this 
paper we discuss a particular member of the family, the Wigner-Yanase information. Using a well-known 
approach that mimics the classical pull-back approach to Fisher information, we are able to give explicit 
formulae for the geodesic distance, the geodesic path, the sectional and scalar curvatures associated to 
Wigner-Yanase information. Moreover we show that this is the only monotone metric for which such an 
approach is possible. 

1 Introduction 

The notion of information proposed by Fisher is fundamental in probability and statistics for a number 
of reasons; here we mention only the Cramer-Rao inequality and the asymptotic behaviour of maximum 
lilkelihood estimators for exponential models (one can see [5] for unexpected features and applications of 
Fisher information). In classical statistics Rao was the first to point out that Fisher information can be seen 
as a riemannian metric on the space of probability densities. This point of view was nicely complemented by 
the results of Chentsov saying that (on the simplex of probability vectors) Fisher information is the unique 
riemannian metric contracting under Markov morphisms. This can be rephrased in a more suggestive way. 
Markov morphisms, or positive mappings, are the mathematical counterpart of the notion of noise. Now 
suppose that we want to use a distance to distinguish different states (probability densities) in a statistically 
relevant way. Then the effect of noise must be that of contracting the metric. Chentsov theorem says therefore 
that in the classical case there is only one choice, the Fisher information (another argument producing Fisher 
information can be found in [43]). 

In the quantum case one deals with density operators instead of density vectors and completely positive 
mappings play the role of Markov morphisms. As often happens in the quantum counterpart of a classical 
theory, instead of a uniqueness result, one has a classification theorem, due to Petz. This result states that 
there is bijection between statistically monotone metrics on quantum state space and the operator monotone 
functions: we have therefore a rich "garden" of candidates for the role of Fisher information in quantum 
physics. Among the elements of this family of metrics one can find, in a certain sense, the most relevant 
riemannian metrics appeared in the literature [37]. 
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Despite the existence of general results for the theory [31, 13, 17, 28, 29, 27, 19] a number of open 
problems resists to investigation. For example it does not exist yet a general formula for the geodesic path 
and the geodesic distance associated to an arbitrary monotone metric. For the use of this kind of distances 
see for example [34] . Because of the absence of a general formula, inequalities (giving bounds for the geodesic 
distance) have been proved [41]. 

In this paper we discuss the Wigner-Yanase skew information. To find the formulae for geodesic path and 
geodesic distance we mimic the classical approach to Fisher information via sphere geometry (one should 
note the importance of determining geodesic path in the study of the 2-Wasserstein metric [6]). Indeed 
Wigner-Yanase information appears as the pull-back of the square root map [18]. Next we prove the formula 
for the scalar curvature. One proof, due to .J.Dittmann, uses the general formula [13] and requires a long 
calculation. The second one just uses the pull-back approach. One should emphasize that, since the scalar 
curvature determines the asymptotic behaviour of the volume (for a riemannian metric) then it has also a 
statistical meaning in relation to the quantum analogue of Jeffrey's rule for determining prior probability 
distributions (see [37]). Finally we prove, as a corollary of the results in [26, 27, 19] that the Wigner-Yanase 
information is the only monotone metric that can be seen as a pull-back metric. 

The paper is organised as follows. In section II we review the geometric approach to Fisher information. 
In section III one finds an introduction to the general theory of statistical monotone metrics. Section IV 
shows how the Wigner-Yanase information can be seen as a monotone riemannian metric. In section V 
we show that the Wigner-Yanase geometry can be seen as the sphere geometry transposed on the space 
of density matrices; moreover we characterise it as the unique pull-back metric. Section VI contains some 
comments on the main results and on some open problems. 



2 Fisher information and its geometry 



The classical definition of Fisher information for an indexed family of densities p$ is given by the variance 
of the score. In the case of a family indexed by only one parameter 6 it is the number 



I{0) = 



^logp. 



(2.1) 



assigned to the parameter 9. For n parameters, say 9 = (0^, . . . , 6*") , it is a matrix defined on the parameter 
manifold given by 



mi 



J log po 



d 



logpe 



(2.2) 



Geometrically this means, that I{9) is a symmetric bilinear form on the tangent spaces of the parameter 
manifold. In a coordinate free language it reads as 



I{9){U,V) = Ee [U{\ogpe) F(logpe)], 



(2.3) 



where U and V are vectors tangent to the parameter manifold and U {logpe) is the derivative of logp0 along 
the direction U, that means U (logpe) = ^ logpe+tf/it-g- 

1(9) is a measure for the statistical distinguishabilty of distribution parameters. Under certain regularity 
conditions for 9 ^ pQ the image of this mapping is a manifold of distributions. This manifold is the actual 
object of interest in information geometry rather than the space of distribution parameters and formula 
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(2.3) defines a Riemannian metric g on it (for a general reference see [1]). Indeed, a vector u tangent to this 
manifold is of the form 

d 



u = -TiPe+tu 
di |t=o 



and the right hand side of (2.3) now reads as 



g{u,v) := Ep 



u V 
P P. 



(2.4) 



defining the Fisher metric on the manifold of densities. If the differential of i-^ pg is not injective, than 
there is some parameter redundance or ambiguity in the choice of U and V , and therefore the right hand 
side of (2.3) does not depend on this choice. 

We restrict now to C M", the simplex of strictly positive probability vectors, that is := {/O e 
K" : J27=i Pi ~ ^^Pi ^ 0, i = 1, . . . , n}. An element p € is a density on the n-point set {1, . . . , n} with 
p{i) = Pi. We regard an element u of the tangent space TpT^ = {u e M" : J27=i ^» = 0} as a function u on 
{1, . . . , n} with u{i) = Ui. 

Definition 2.1. The Fisher-Rao Riemannian metric on TpCP„ is given by 

n 

i=i 

for u, V & TpT„. 

To see the relation between this metric and the Fisher metric, let u,v & T^p'^n- We obtain from (2.4) 

Sr^ U{l) V{l) UiVi 

i=i i=i P' 

in accordance with (2.5). 

The following result is well known and is a very special case of a far more general situation (see [15] for 
example). 

Theorem 2.2. The manifold equipped with the Fisher-Rao Riemannian metric {■, is isometric with 
an open subset of the sphere of radius 2 inW^. 

Proof. We consider the mapping : CP^ — > ^2 ~^ C M", 

V{P) ■■= 2 is/pi, s/p^) . 
Then Dpifiu) = ( J=, . . . , ^) and we get 

n 

I^p'P ((•. •)^) (■". ■= {Dpf{u), Dpip{v))^" = ^ = {u, -y) J . 

i=i P' 

Hence the standard metric on the sphere of radius 2 is pulled back to the Fisher- Rao Riemannian metric. □ 
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This identification of CPn witli an open subset of a radius 2 sphere aUows for obtaining differential geo- 
metrical quantities of the Riemannian manifold (•, •)^). From the very definition of geodesic distance, 
geodesic path and scalar curvature, one has for with Pi,P2 S S"~^, 

1) geodesic distance 

d{Pi,P2) = r ■ arcos f 

2) geodesic path connecting Pi and P2 : 

^ ^' \\{l-t)Pl+tP2\\ 

(of course, t is not the arc length parameter); 

3) scalar curvature 

Scal(t;) = ^(n- l)(n-2) 

because 5""^ has constant sectional curvature equal to 

Let us denote hy dF,jF, Scalp respectively the corresponding quantities for the Fisher information. The 

above considerations give, for p,(J & 
1) Bhattacharya distance 

dY{p,cF) = 2arccos p]'"^ cr]'"^ 



2) geodesic path connecting p and a: 

3) scalar curvature 



ScalF(p) = i(n - l)(n - 2) Vp e T„. 

The Levi-Civita connection associated to Fisher metric can be decomposed using the geometry of mixture 
and exponential models. The rest of the section explains how. 

Definition 2.3. A dualistic structure on a manifold M is a triple ((•,•), V,V) where (•,•) is a riemannian 
metric on M and V, V are affine connections on M such that 

X{Y, Z) = {VxY, Z) + (y, VxZ) 

where X,Y,Z are vector fields. If U^,U^ are the parallel transport associated to V,V then the above 
equation is equivalent to 

{U''{u),U^{v))^{u,v). 

A divergence on a manifold is a smooth nonnegative function £) : M x M ^ M such that D{p, cr) = iff 
p = a. To each divergence D one may associate a dualistic structure ((•, •), V, V) (see [1, 14]). 
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Let be the Lcvi-Civita connection of Fisher information. The KuUback-Leibler relative entropy 
K{p,a) = X]jPi(logpi — log CTj) gives a dualistic structure ((•, •)'^, V™, V^) such that 

where V™, V® are the mixture and exponential connections. These connections are torsion free and flat: 
once the representation by scores is used for the tangent spaces, the associated parallel transports are given 
by 

The geodesies of V", V are, respectively, the mixture and exponential models. 

3 Metric contraction under coarse graining 

In the commutative case a Markov morphism (or stochastic map) is a positive operator T : R" — > R'°. In the 
noncommutative case a stochastic map is a completely positive and trace preserving operator T : Mn Mk 
where M„ denotes the space of n by n complex matrices. We shall denote by 2)„ the manifold of strictly 
positive elements of M„ and by C D„ the submanifold of density matrices. 

In the commutative case a monotone metric is a family of riemannian metrics g = {g"} on {?„}, n e N 
such that 

gJi,\^^{TX,TX)<g;{X,X) 

holds for every stochastic mapping T : M" — > and all p G^n and X € TpT^. 

In perfect analogy, a monotone metric in the noncommutative case is a family of Riemannian metrics 
g = {g"-} on {Dl}, n G N such that 

gjp^^^{TX,TX)<g;{X,X) 

holds for every stochastic mapping T : Af„ Mm and all p E and X E TpD\. 

Let us recall that a function / : (0, oo) ^ M is called operator monotone if for any n G N, any A, B G Mn 
such that < ^ < B, the inequalities < f{A) < ,f{B) hold. An operator monotone function is said 
symmetric if f{x) :— xf{x~^) and normalized if /(I) = 1. In what follows by operator monotone we moan 
normalised symmetric operator monotone. With each operator monotone function / one associates also the 
so-called Chentsov-Morotzova function 



Cf{x,y) := 




x,y>0. 



Define Lp{A) := pA, and Rp{A) := Ap. Since Lp,Rp commute we may define c{Lp,Rp). Now we can state 
the fundamental theorems about monotone metrics (uniqueness and classification are up to scalars). 

Theorem 3.1. [7] There exists a unique monotone metric on given by the Fisher information. 

Theorem 3.2. [36] There exists a bijective correspondence between monotone metrics on and operator 

monotone functions given by the formula 



{AB)^j := Tr{A-cf{Lp,Rp){B)). 
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The tangent space to at p is given by TpD^ = {A e M,, : A = A*,Tr{A) ^ 0}, and can be 
decomposed as TpD^ = {TpDly © {TpDl)", where {TpTil^Y := {A e TpDi : [A,p] = 0}, and (T^Di)" is the 
orthogonal complement of {TpDj^y, with respect to the Hilbert-Schmidt scalar product {A,B) := Tr{A*B). 
Each statistically monotone metric has a unique expression (up to a constant) given by Tr{p~^A'^), for 
A G (TpDl^y. The following result will be used in Section 5. 

Proposition 3.3. (See [3,]). Let A G T^D^ be decomposed as A = A" + i[p, U] where A" G {TpV^y and 
i[p, U] e (TpTii)". Suppose cp G 6^(0, +oo). Then 

iDpip)iA)^ip'{p)A^ + t[^ip),U]. 

As proved by Lesniewski and Ruskai each monotone metric is the hessian of a suitable relative entropy; 
to state this result more precisely, we introduce some notation. In what follows g is an operator convex 
function defined on (0,+oo) and such that g{l) = 0. The formula 



g{x) + xg{x-^) 



associates a normalised, symmetric operator monotone function / = to each g. We denote by A^.^ = 
L„R~^ the relative modular operator. The relative ^r-entropy of p and a is defined as 

Hg{p,a) ■.= Tr{p^g{^,,p){p^)). 

Hg is a divergence on 2)„ in the sense of [14, 1]. If p, a are diagonal Hg, reduces to the commutative relative 
5-entropy (see [9]). 

Theorem 3.4. [31] Let g be operator convex, g{l) = 0, f = fg and p G D„. Then 

d d 



dt ds 



Hg{p + tA,p + sB) = Tr{A ■ Cf{Lp, Rp){B)). 



t=s=0 



To state the general formula for the scalar curvature of a monotone metric we need some auxiliary 
functions. In what follows d, (logc)' denote derivatives with respect to the first variable, and c = Cf. 



hi{x,y,z) 
h2{x,y,z) 
h3{x,y,z) 



c{x,y) - zc{x,z) c{y,z) 
(x- z){y- z)c{x,z)c{y,z) ' 

ic{x,z) -c{y,z)f 
{x - yYc{x,y)c{x, z)c{y, z) ' 
{\nc)\z,x)-{\nc)'{z,y) 



x-y 

hi{x,y,z) := z{[nc)' {z,x) {inc)' {z,y) , 

h := hi-^h2 + 2/i3 - /14 . (3.1) 

The functions hi have no essential singularities if arguments coincide. 

Note that (^4, B)^ := Tr{A-Cf{Lp, Rp)[B)) defines a riemannian metric also over !D„ (D^ is a submanifold 

of codimension 1). Let Scal/(p) be the scalar curvature of (!)„,(•, •)^) at p and Scalj(p) be the scalar 

curvature of (D;^, (•, •)^). 
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Theorem 3.5. [13] Let a{p) be the spectrum of p. Then 



Scalfip) = X] Kx,y,z)- ^ h{x,x,x) (3.2) 



Scal){p) = Scalfip) + - - 2)- 



4' 

4 Wigner-Yanase information as a riemannian metric 

Let p G Dl^ be a density matrix and let ^ be a self adjoint matrix. The Wigner-Yanase information (or skew 
information, information content relative to A) was defined as 

I{p,A} :=~Tr([pi/2,^]2) 

where [•, •] denotes the commutator(see [42]). Consider now gix) := gwyix) ■= 4(1 — y/x). In this case 

F,(p,<7)=4(l-Tr(p5^5)). 
The associated operator monotone and Chentsov-Morotzova functions are 

/wy(.) := \iV^+l)' cwy(x,,) = 

Let us consider the monotone metric 

{A,B)jy:= Tr(Acwy(L„i?p)(B)). 
A typical element of iTpDn)° has the form i [p, A], where A is self-adjoint. We have 

(i [p, A],i [p, A])jy = Tr (i [p, AjAiLy' + Ry^)-^ii [p, A])) 

= -4Tr (iLy^ + Ry^)-\[p,A]) iiy^ + Ry^)-\[p,A])) 

= -4Tr (iiy^ + i?y2)-i o iL, - R,)iA) (Ly2 + ^i/2)-i o iLp - Rp)iA)) 

= -4TV [iLy- - Ry-)iA) ity- - Ry-)iA)) 

= -\^A{p'l\Af\^Mip,A) 



and this explains why the monotone metric associated with the function j(-\/x -|- 1)^ is called the Wigner- 
Yanase monotone metric. 

5 The main result 

First of all we calculate the scalar curvature of Wigner-Yanase information using Theorem 3.5. If /wy(a;) := 
\i\fx -h 1)^ we write Scal^ for Scalj. 
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Theorem 5.1. 

Scal]^{p) = \{n''-l){n'-2). 
Proof. Let us calculate the auxiliary functions for cwy{x,y) :— ^{_\fx + ^Jy)~'^ . We get 



hi{x,y,z) = 

h2{x,y,z) = 

h3{x,y,z) = 

hi{x,y,z) = 



4 ^/yf (v^ + yi) (x/y + 

^{^+^zf{^+^f 



1 



Now one can verify by calculation that the symmetrization of /ii — i /12 and the symmetrization of 2 ^,3 — /14 
vanish. Hence, by (3.1), the symmetrization of h vanishes, too. Since we sum up in formula (3.2) over all 
triples of eigenvalues we may replace h with its symmetrization without changing the summation result. 
Therefore 

Scalwy(p) = , Scal^y(p) = ^(n^ _ i)^n' - 2) Vp e Dj, . 

□ 

In what follows we use the pull-back approach to derive (and explain) the above formula in a direct way. 
Furthermore we deduce the geodesic distance and geodesic equation. 

Let us denote by S the manifold {A e M„ : Ti AA* =2,A = A*}. Clearly, since S is the intersection of 
the radius 2 sphere in C"^" and the subspace of Hermitian matrices, it is isometric with a radius 2 sphere 

sf-\ 

Now, let : ^ S C C"^", fp{p) := 2^. Then we have the following result (see [26, 18, 28, 21]). 

2_i 

Theorem 5.2. The pull-hack by the map ip of the natural metric on S = S2 coincides with the Wigner- 
Yanase monotone metric. 

Proof. Let A and B be vectors tangent to at p. Because ip{p) (p{p) = 4p wc get from the Leibniz rule 
Dpip{A)y/p + ^Dp(f{A) = 2 A Thus, the differential of at the point p is given by 

DMA) = 2[Ll/^-^Rl/^y\A). 

Therefore the pull-back of the real part of the Hilbert-Schmidt metric yields 
Dp<p{Re {; ■)){A, B) = Re {Dp^{A), Dp^{B)) 

= 4Re((Ly2 + i?y2)-i(^), + Ry^)-\B)) 

= A{A,{Ly^ + R}J^r\B)) 
= 4TVA(Ly2 + iji/2)-2(5) 

= TrAc^yiLp^R^m = {A,B)jy 
which was to be proved. □ 
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Prom this result one can deduce the following 

Theorem 5.3. For the geodesic distance, geodesic path and the scalar curvature of Wigner-Yanase infor- 
mation the following formulae hold 
1) geodesic distance 



dwy{p,(T) = 2arccos{Tr{p^''^a^''^)) (5.1) 



2) geodesic path 



^^y^*)-\r{{{i-t)^p + tv^r) ^^■^> 

Scalljy{p) = ^{n^-l){n^-2). (5.3) 

Proof. The formulae are immediate consequences of the preceding theorem and of sphere geometry. Indeed 

by the pull-back argument the Wigner-Yanase metric looks locally like a sphere of radius 2 of dimension 
(n^ — 1)- But for a sphere of this kind the sectional curvatures are all equal to j and therefore the scalar 
curvature is given by |(n^ — l)(n^ — 2). □ 

One may ask if other monotone metrics are the pull-back of some function tp different from the square 
root. The rest of the section answers this question. 

Definition 5.4. A monotone metric (•, ■)pj is a pull-back metric if there exists a manifold S C M„ and a 
function ip G 6^(0, +00) such that the pull-back metric of (p : — > S C M„ coincides with (•, ■)pj. 

Proposition 5.5. Let {■,-)p.f be a monotone m,(-:tric. let c = Cf be the associated CM-function and let 
€ G^{0, -|-oo). We have that {■, ■)pj is a pull-back metric by (p if and only if 

ip{x) - ip{y) \ ^ 

J =c(x,2/). (5.4) 

Proof. Apply the formula (3.3) to tangent vectors in (T^D^)". □ 

Definition 5.6. Let v'l X € 6^(0, -|-oo). We say that {ip, x) is a dual pair if there exist an operator monotone 
/ such that 

ip{x) - ip{y) x{x) - x{y) 



3) scalar curvature 



X — y X — y 

where c = Cf is the CM-function associated to /. 



= c{x,y). (5.5) 



In such a case we say that / (or Cf) is a dual function. If {fjip) is a dual pair with respect to / (or c/) 
we say that / (or c/) is self-dual. Obviously one has 

Proposition 5.7. To say that {■,-)pj is a pull-back metric by ip it is equivalent to say that f (or Cf) is 

self-dual with respect to {p. 



Wigner-Yanase information on quantum state space 



10 



Definition 5.8. Two dual pairs {(fi, x), {'fi, X) are equivalent if there exist constants Ai,A2, -81,-62 such that 
A1A2 = 1 

= AiLp + Bi 
X = A2X + B2. 

Obviously equivalent pairs define the same CM-function. In what follows we consider dual pairs up to 
this equivalence relation with the traditional abuse of language. We are ready to state the fundamental 
result of the theory that classifies dual pairs. 

Theorem 5.9. ([26, 27, 19]) Let (p,x ^ 6^(0, +00). Then {ip,x) is a dual pair if and only if one of the 
following two possibilities hold 

(^(x),x(x)) = (^,^) pe[-i,2]\{o,i} 

{ip{x),x{x)) = (a;,log(a;)). 

Corollary 5.10. The function f{x) = \{\/x + 1)^ is the only self-dual operator monotone function, that is: 
the Wigner-Yanase metric is the only pull-hack metric among statistically monotone metrics. 

6 Conclusions 

Remark 6.1. Note that the formula (5.1) implies d'^y{p,a-) < 2tt. An analogous inequality holds for the 
Bures metric (see [10], p. 311). It seems that in the literature there are no other explicit formulas for the 
geodesies distance. For example it is known that the formula 

d^{p, a) = ^2-2Tr(p^ap^)5 (6.1) 

defines a metric on the state space whose infinitesimal counterpart (say the hessian) is the S'L-D-metric (that 
is f{x) = ^(1 + x)). But this docs not imply that Equation (6.1) is the geodesic distance of the S'-Li^-metric. 

Remark 6.2. In general it is difficult to give explicit formulae for geodesic paths of monotone metrics. In 
the case of the Bures metric these geodesies can be given because they are projections of large circles on a 
sphere in the purifying space (see [10] p. 311 and [12] [4]). For a discussion of geodesies for a-connections see 
[28, 29]. 

Remark 6.3. A classical theorem classifies the spaces of costant curvature [30]. It is not known at the moment 
if there are other monotone metrics of costant sectional and scalar curvature. 

Remark 6.4. We have seen in the commutative case that for the Levi-Civita connection of the pull-back of 
the square root it is available the decomposition 

In the non-commutative case an analogous decomposition for the pull-back of the square root no longer 
holds. Indeed, on one hand, the use of Umegaki relative entropy H{p, a) — Tr{p{\ogp — logcr)) produces a 
similar decomposition, but for the Bogoliubov-Kubo-Mori metric [33, 1, 23]. On the other hand, if one uses 
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H^y{p,a) = 4(1 — Tr(p-'^/^(T^/^)) as a divergence on D\ and constructs the associated dualistic structure 
((■) ■)^^^) V^^y , V'^^y) (again following the lines of [14, 1]), then the construction is trivial, namely the 
dual connections both coincide with the Levi-Civita connection of the Wigner-Yanase information. This is 
easily seen on where Hg{p,a) reduces to Csiszar relative g-entropy: it is known that such an entropy 
induces the a-geometry where a is given by the formula a = 3+2g"' {1) / g" (1) (see [1] p. 57). For g = 4(1— y^) 
this gives a = that is the Fisher information case (see also [21]). 
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